Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 2771097 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 253.7 MiB |
| Average record size in memory | 96.0 B |
Variable types
| Numeric | 12 |
|---|
df_index is highly correlated with u and 9 other fields | High correlation |
u is highly correlated with df_index and 9 other fields | High correlation |
g is highly correlated with df_index and 9 other fields | High correlation |
r is highly correlated with df_index and 9 other fields | High correlation |
i is highly correlated with df_index and 9 other fields | High correlation |
z is highly correlated with df_index and 9 other fields | High correlation |
uErr is highly correlated with df_index and 9 other fields | High correlation |
gErr is highly correlated with df_index and 9 other fields | High correlation |
rErr is highly correlated with df_index and 9 other fields | High correlation |
iErr is highly correlated with df_index and 9 other fields | High correlation |
zErr is highly correlated with df_index and 9 other fields | High correlation |
df_index is highly correlated with u and 6 other fields | High correlation |
u is highly correlated with df_index and 5 other fields | High correlation |
g is highly correlated with df_index and 7 other fields | High correlation |
r is highly correlated with df_index and 7 other fields | High correlation |
i is highly correlated with df_index and 6 other fields | High correlation |
z is highly correlated with df_index and 6 other fields | High correlation |
uErr is highly correlated with u and 2 other fields | High correlation |
gErr is highly correlated with df_index and 4 other fields | High correlation |
rErr is highly correlated with df_index and 5 other fields | High correlation |
zErr is highly correlated with z | High correlation |
df_index is highly correlated with g and 7 other fields | High correlation |
u is highly correlated with g and 4 other fields | High correlation |
g is highly correlated with df_index and 9 other fields | High correlation |
r is highly correlated with df_index and 8 other fields | High correlation |
i is highly correlated with df_index and 7 other fields | High correlation |
z is highly correlated with df_index and 7 other fields | High correlation |
uErr is highly correlated with u and 2 other fields | High correlation |
gErr is highly correlated with df_index and 9 other fields | High correlation |
rErr is highly correlated with df_index and 8 other fields | High correlation |
iErr is highly correlated with df_index and 7 other fields | High correlation |
zErr is highly correlated with df_index and 7 other fields | High correlation |
df_index is highly correlated with u and 4 other fields | High correlation |
u is highly correlated with df_index and 4 other fields | High correlation |
g is highly correlated with df_index and 4 other fields | High correlation |
r is highly correlated with df_index and 4 other fields | High correlation |
i is highly correlated with df_index and 4 other fields | High correlation |
z is highly correlated with df_index and 4 other fields | High correlation |
uErr is highly correlated with gErr | High correlation |
gErr is highly correlated with uErr and 1 other fields | High correlation |
zErr is highly correlated with gErr | High correlation |
iErr is highly skewed (γ1 = 215.8193612) | Skewed |
zErr is highly skewed (γ1 = 126.8038418) | Skewed |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
ID has unique values | Unique |
Reproduction
| Analysis started | 2022-02-24 04:35:07.478487 |
|---|---|
| Analysis finished | 2022-02-24 04:39:09.440038 |
| Duration | 4 minutes and 1.96 second |
| Software version | pandas-profiling v3.1.1 |
| Download configuration | config.json |
df_index
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORMUNIQUE| Distinct | 2771097 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1385598.454 |
| Minimum | 0 |
|---|---|
| Maximum | 2771158 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 138573.8 |
| Q1 | 692824 |
| median | 1385606 |
| Q3 | 2078381 |
| 95-th percentile | 2632602.2 |
| Maximum | 2771158 |
| Range | 2771158 |
| Interquartile range (IQR) | 1385557 |
Descriptive statistics
| Standard deviation | 799958.2458 |
|---|---|
| Coefficient of variation (CV) | 0.5773377152 |
| Kurtosis | -1.199986471 |
| Mean | 1385598.454 |
| Median Absolute Deviation (MAD) | 692779 |
| Skewness | -2.287114509 × 10-5 |
| Sum | 3.83962772 × 1012 |
| Variance | 6.39933195 × 1011 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1847459 | 1 | < 0.1% |
| 1847451 | 1 | < 0.1% |
| 1847452 | 1 | < 0.1% |
| 1847453 | 1 | < 0.1% |
| 1847454 | 1 | < 0.1% |
| 1847455 | 1 | < 0.1% |
| 1847456 | 1 | < 0.1% |
| 1847457 | 1 | < 0.1% |
| 1847458 | 1 | < 0.1% |
| Other values (2771087) | 2771087 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 2771158 | 1 | |
| 2771157 | 1 | |
| 2771156 | 1 | |
| 2771155 | 1 | |
| 2771154 | 1 | |
| 2771153 | 1 | |
| 2771152 | 1 | |
| 2771151 | 1 | |
| 2771150 | 1 | |
| 2771149 | 1 |
| Distinct | 2771097 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.237664767 × 1018 |
| Minimum | 1.23764588 × 1018 |
|---|---|
| Maximum | 1.237680531 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 1.23764588 × 1018 |
|---|---|
| 5-th percentile | 1.237651538 × 1018 |
| Q1 | 1.237658613 × 1018 |
| median | 1.237663784 × 1018 |
| Q3 | 1.237668298 × 1018 |
| 95-th percentile | 1.237679541 × 1018 |
| Maximum | 1.237680531 × 1018 |
| Range | 3.465180558 × 1013 |
| Interquartile range (IQR) | 9.685164491 × 1012 |
Descriptive statistics
| Standard deviation | 8.395134131 × 1012 |
|---|---|
| Coefficient of variation (CV) | 6.783043642 × 10-6 |
| Kurtosis | -0.5909072918 |
| Mean | 1.237664767 × 1018 |
| Median Absolute Deviation (MAD) | 4.663256416 × 1012 |
| Skewness | 0.3696778269 |
| Sum | -3.321424075 × 1018 |
| Variance | 7.047827708 × 1025 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.237671129 × 1018 | 1 | < 0.1% |
| 1.237661434 × 1018 | 1 | < 0.1% |
| 1.237671143 × 1018 | 1 | < 0.1% |
| 1.237678879 × 1018 | 1 | < 0.1% |
| 1.237663916 × 1018 | 1 | < 0.1% |
| 1.23765125 × 1018 | 1 | < 0.1% |
| 1.237664294 × 1018 | 1 | < 0.1% |
| 1.237678663 × 1018 | 1 | < 0.1% |
| 1.237665329 × 1018 | 1 | < 0.1% |
| 1.237679323 × 1018 | 1 | < 0.1% |
| Other values (2771087) | 2771087 |
| Value | Count | Frequency (%) |
| 1.23764588 × 1018 | 1 | |
| 1.23764588 × 1018 | 1 | |
| 1.23764588 × 1018 | 1 | |
| 1.23764588 × 1018 | 1 | |
| 1.23764588 × 1018 | 1 | |
| 1.23764588 × 1018 | 1 | |
| 1.23764588 × 1018 | 1 | |
| 1.23764588 × 1018 | 1 | |
| 1.23764588 × 1018 | 1 | |
| 1.23764588 × 1018 | 1 |
| Value | Count | Frequency (%) |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 |
| Distinct | 862294 |
|---|---|
| Distinct (%) | 31.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.54721891 |
| Minimum | 7.918076 |
|---|---|
| Maximum | 33.45042 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 7.918076 |
|---|---|
| 5-th percentile | 18.744108 |
| Q1 | 20.68293 |
| median | 22.81069 |
| Q3 | 24.18341 |
| 95-th percentile | 26.06424 |
| Maximum | 33.45042 |
| Range | 25.532344 |
| Interquartile range (IQR) | 3.50048 |
Descriptive statistics
| Standard deviation | 2.279702953 |
|---|---|
| Coefficient of variation (CV) | 0.1011079443 |
| Kurtosis | -0.6206763375 |
| Mean | 22.54721891 |
| Median Absolute Deviation (MAD) | 1.62961 |
| Skewness | -0.2604099229 |
| Sum | 62480530.68 |
| Variance | 5.197045552 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 24.63466 | 363 | < 0.1% |
| 24.63467 | 315 | < 0.1% |
| 24.63465 | 259 | < 0.1% |
| 24.63468 | 224 | < 0.1% |
| 24.63469 | 101 | < 0.1% |
| 24.6347 | 73 | < 0.1% |
| 24.63464 | 71 | < 0.1% |
| 24.63471 | 32 | < 0.1% |
| 24.63463 | 24 | < 0.1% |
| 24.63472 | 22 | < 0.1% |
| Other values (862284) | 2769613 |
| Value | Count | Frequency (%) |
| 7.918076 | 1 | |
| 9.866534 | 1 | |
| 9.941942 | 1 | |
| 10.17025 | 1 | |
| 10.22717 | 1 | |
| 10.48895 | 1 | |
| 10.54181 | 1 | |
| 11.0643 | 1 | |
| 11.21397 | 1 | |
| 11.41754 | 1 |
| Value | Count | Frequency (%) |
| 33.45042 | 1 | |
| 32.66663 | 1 | |
| 31.77132 | 1 | |
| 31.38664 | 1 | |
| 31.10792 | 1 | |
| 30.96 | 1 | |
| 30.81344 | 1 | |
| 30.77991 | 1 | |
| 30.77358 | 1 | |
| 30.76049 | 1 |
| Distinct | 771107 |
|---|---|
| Distinct (%) | 27.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.85631799 |
| Minimum | 7.466997 |
|---|---|
| Maximum | 33.72469 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 7.466997 |
|---|---|
| 5-th percentile | 17.24249 |
| Q1 | 18.798 |
| median | 21.5492 |
| Q3 | 22.42596 |
| 95-th percentile | 23.561882 |
| Maximum | 33.72469 |
| Range | 26.257693 |
| Interquartile range (IQR) | 3.62796 |
Descriptive statistics
| Standard deviation | 2.120288897 |
|---|---|
| Coefficient of variation (CV) | 0.1016617074 |
| Kurtosis | -0.6362952592 |
| Mean | 20.85631799 |
| Median Absolute Deviation (MAD) | 1.22676 |
| Skewness | -0.5167377968 |
| Sum | 57794880.23 |
| Variance | 4.495625009 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 25.11438 | 108 | < 0.1% |
| 25.11437 | 66 | < 0.1% |
| 25.11439 | 57 | < 0.1% |
| 25.1144 | 45 | < 0.1% |
| 22.2569 | 25 | < 0.1% |
| 22.14977 | 25 | < 0.1% |
| 22.03572 | 24 | < 0.1% |
| 22.23071 | 23 | < 0.1% |
| 22.11493 | 23 | < 0.1% |
| 22.19956 | 23 | < 0.1% |
| Other values (771097) | 2770678 |
| Value | Count | Frequency (%) |
| 7.466997 | 1 | |
| 9.897096 | 1 | |
| 10.24659 | 1 | |
| 10.32773 | 1 | |
| 10.40744 | 1 | |
| 10.53339 | 1 | |
| 10.64063 | 1 | |
| 10.75463 | 1 | |
| 11.10078 | 1 | |
| 11.15687 | 1 |
| Value | Count | Frequency (%) |
| 33.72469 | 1 | |
| 32.90944 | 1 | |
| 32.14997 | 1 | |
| 31.67036 | 1 | |
| 31.60224 | 1 | |
| 31.52315 | 1 | |
| 31.35417 | 1 | |
| 31.32736 | 1 | |
| 31.06618 | 1 | |
| 30.95307 | 1 |
| Distinct | 701819 |
|---|---|
| Distinct (%) | 25.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.52907862 |
| Minimum | 8.902843 |
|---|---|
| Maximum | 22.99995 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 8.902843 |
|---|---|
| 5-th percentile | 16.43614 |
| Q1 | 17.74942 |
| median | 20.05909 |
| Q3 | 20.93971 |
| 95-th percentile | 22.10703 |
| Maximum | 22.99995 |
| Range | 14.097107 |
| Interquartile range (IQR) | 3.19029 |
Descriptive statistics
| Standard deviation | 1.868157141 |
|---|---|
| Coefficient of variation (CV) | 0.09566028066 |
| Kurtosis | -0.7443979195 |
| Mean | 19.52907862 |
| Median Absolute Deviation (MAD) | 1.34901 |
| Skewness | -0.4281007916 |
| Sum | 54116971.16 |
| Variance | 3.490011105 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 20.5507 | 23 | < 0.1% |
| 20.29045 | 22 | < 0.1% |
| 20.52563 | 22 | < 0.1% |
| 20.50425 | 22 | < 0.1% |
| 20.78461 | 22 | < 0.1% |
| 20.61646 | 22 | < 0.1% |
| 20.44975 | 22 | < 0.1% |
| 20.71745 | 22 | < 0.1% |
| 20.43664 | 22 | < 0.1% |
| 20.63658 | 22 | < 0.1% |
| Other values (701809) | 2770876 |
| Value | Count | Frequency (%) |
| 8.902843 | 1 | |
| 9.474476 | 1 | |
| 9.501574 | 1 | |
| 9.848258 | 1 | |
| 9.903746 | 1 | |
| 9.920951 | 1 | |
| 10.04462 | 1 | |
| 10.07247 | 1 | |
| 10.10956 | 1 | |
| 10.13125 | 1 |
| Value | Count | Frequency (%) |
| 22.99995 | 1 | |
| 22.99994 | 1 | |
| 22.99993 | 2 | |
| 22.99991 | 1 | |
| 22.9999 | 1 | |
| 22.99981 | 2 | |
| 22.9998 | 1 | |
| 22.99973 | 1 | |
| 22.99971 | 1 | |
| 22.99968 | 1 |
| Distinct | 684990 |
|---|---|
| Distinct (%) | 24.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.7975133 |
| Minimum | 8.364965 |
|---|---|
| Maximum | 31.65274 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 8.364965 |
|---|---|
| 5-th percentile | 16.020778 |
| Q1 | 17.3165 |
| median | 19.17881 |
| Q3 | 19.91382 |
| 95-th percentile | 21.4056 |
| Maximum | 31.65274 |
| Range | 23.287775 |
| Interquartile range (IQR) | 2.59732 |
Descriptive statistics
| Standard deviation | 1.682664926 |
|---|---|
| Coefficient of variation (CV) | 0.0895152938 |
| Kurtosis | -0.3469783265 |
| Mean | 18.7975133 |
| Median Absolute Deviation (MAD) | 1.14515 |
| Skewness | -0.3030799236 |
| Sum | 52089732.72 |
| Variance | 2.831361252 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 24.3618 | 78 | < 0.1% |
| 24.36181 | 47 | < 0.1% |
| 19.74928 | 27 | < 0.1% |
| 19.523 | 26 | < 0.1% |
| 19.66745 | 26 | < 0.1% |
| 19.61283 | 26 | < 0.1% |
| 19.6445 | 25 | < 0.1% |
| 19.54819 | 25 | < 0.1% |
| 19.56573 | 25 | < 0.1% |
| 19.69037 | 25 | < 0.1% |
| Other values (684980) | 2770767 |
| Value | Count | Frequency (%) |
| 8.364965 | 1 | |
| 8.411285 | 1 | |
| 9.370869 | 1 | |
| 9.527082 | 1 | |
| 9.550007 | 1 | |
| 9.568777 | 1 | |
| 9.75404 | 1 | |
| 9.85052 | 1 | |
| 9.886005 | 1 | |
| 9.930574 | 1 |
| Value | Count | Frequency (%) |
| 31.65274 | 1 | |
| 31.23163 | 1 | |
| 31.14982 | 1 | |
| 31.07335 | 1 | |
| 30.84629 | 1 | |
| 30.83151 | 1 | |
| 30.71105 | 1 | |
| 30.65209 | 1 | |
| 30.57623 | 1 | |
| 30.49861 | 1 |
| Distinct | 683963 |
|---|---|
| Distinct (%) | 24.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.39512669 |
| Minimum | 6.485586 |
|---|---|
| Maximum | 30.01704 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 6.485586 |
|---|---|
| 5-th percentile | 15.70244 |
| Q1 | 17.02607 |
| median | 18.72149 |
| Q3 | 19.44233 |
| 95-th percentile | 21.12195 |
| Maximum | 30.01704 |
| Range | 23.531454 |
| Interquartile range (IQR) | 2.41626 |
Descriptive statistics
| Standard deviation | 1.646698893 |
|---|---|
| Coefficient of variation (CV) | 0.08951821429 |
| Kurtosis | 0.02160338546 |
| Mean | 18.39512669 |
| Median Absolute Deviation (MAD) | 1.01411 |
| Skewness | -0.1468942164 |
| Sum | 50974680.37 |
| Variance | 2.711617243 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 22.8269 | 569 | < 0.1% |
| 22.82691 | 330 | < 0.1% |
| 22.82689 | 170 | < 0.1% |
| 22.82692 | 89 | < 0.1% |
| 22.82693 | 41 | < 0.1% |
| 22.82694 | 31 | < 0.1% |
| 19.17929 | 29 | < 0.1% |
| 19.21716 | 28 | < 0.1% |
| 19.25344 | 28 | < 0.1% |
| 19.15662 | 27 | < 0.1% |
| Other values (683953) | 2769755 |
| Value | Count | Frequency (%) |
| 6.485586 | 1 | |
| 6.972855 | 1 | |
| 7.140388 | 1 | |
| 7.281719 | 1 | |
| 9.563122 | 1 | |
| 9.673311 | 1 | |
| 9.759146 | 1 | |
| 9.771158 | 1 | |
| 9.971849 | 1 | |
| 9.987154 | 1 |
| Value | Count | Frequency (%) |
| 30.01704 | 1 | |
| 29.38374 | 1 | |
| 29.26314 | 1 | |
| 29.18374 | 1 | |
| 29.17408 | 1 | |
| 29.04169 | 1 | |
| 29.03935 | 1 | |
| 29.0194 | 1 | |
| 28.96626 | 1 | |
| 28.91793 | 1 |
| Distinct | 2426278 |
|---|---|
| Distinct (%) | 87.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6795658449 |
| Minimum | 0.0001527219 |
|---|---|
| Maximum | 68.59662 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 0.0001527219 |
|---|---|
| 5-th percentile | 0.034038648 |
| Q1 | 0.1267589 |
| median | 0.5586478 |
| Q3 | 1.03992 |
| 95-th percentile | 1.7884246 |
| Maximum | 68.59662 |
| Range | 68.59646728 |
| Interquartile range (IQR) | 0.9131611 |
Descriptive statistics
| Standard deviation | 0.6173188602 |
|---|---|
| Coefficient of variation (CV) | 0.9084018346 |
| Kurtosis | 128.3319909 |
| Mean | 0.6795658449 |
| Median Absolute Deviation (MAD) | 0.4472315 |
| Skewness | 3.055100909 |
| Sum | 1883142.874 |
| Variance | 0.3810825751 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.20849 | 8 | < 0.1% |
| 1.176417 | 8 | < 0.1% |
| 1.151895 | 8 | < 0.1% |
| 1.040624 | 8 | < 0.1% |
| 1.069763 | 8 | < 0.1% |
| 1.136649 | 8 | < 0.1% |
| 1.018593 | 8 | < 0.1% |
| 1.043883 | 7 | < 0.1% |
| 1.330009 | 7 | < 0.1% |
| 1.077335 | 7 | < 0.1% |
| Other values (2426268) | 2771020 |
| Value | Count | Frequency (%) |
| 0.0001527219 | 1 | |
| 0.0002531087 | 1 | |
| 0.0005957853 | 1 | |
| 0.0006323291 | 1 | |
| 0.001352592 | 1 | |
| 0.001526264 | 1 | |
| 0.002129648 | 1 | |
| 0.002286536 | 1 | |
| 0.002411008 | 1 | |
| 0.002435118 | 1 |
| Value | Count | Frequency (%) |
| 68.59662 | 1 | |
| 55.50948 | 1 | |
| 46.49884 | 1 | |
| 40.79873 | 1 | |
| 39.44621 | 1 | |
| 38.20195 | 1 | |
| 36.87848 | 1 | |
| 30.9955 | 1 | |
| 28.22746 | 1 | |
| 27.37109 | 1 |
| Distinct | 2400035 |
|---|---|
| Distinct (%) | 86.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1352228852 |
| Minimum | 0.0002041792 |
|---|---|
| Maximum | 41.14543 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 0.0002041792 |
|---|---|
| 5-th percentile | 0.0055366346 |
| Q1 | 0.01317563 |
| median | 0.09019641 |
| Q3 | 0.1729824 |
| 95-th percentile | 0.45140432 |
| Maximum | 41.14543 |
| Range | 41.14522582 |
| Interquartile range (IQR) | 0.15980677 |
Descriptive statistics
| Standard deviation | 0.1912511815 |
|---|---|
| Coefficient of variation (CV) | 1.41434034 |
| Kurtosis | 1938.599197 |
| Mean | 0.1352228852 |
| Median Absolute Deviation (MAD) | 0.0779491 |
| Skewness | 14.75855721 |
| Sum | 374715.7316 |
| Variance | 0.03657701441 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.1326784 | 7 | < 0.1% |
| 0.1179987 | 7 | < 0.1% |
| 0.1224872 | 7 | < 0.1% |
| 0.1574412 | 7 | < 0.1% |
| 0.128186 | 7 | < 0.1% |
| 0.1223937 | 7 | < 0.1% |
| 0.1282656 | 7 | < 0.1% |
| 0.1170574 | 7 | < 0.1% |
| 0.1458281 | 7 | < 0.1% |
| 0.1400094 | 7 | < 0.1% |
| Other values (2400025) | 2771027 |
| Value | Count | Frequency (%) |
| 0.0002041792 | 1 | |
| 0.000344248 | 1 | |
| 0.0003744256 | 1 | |
| 0.0004111342 | 1 | |
| 0.0004521218 | 1 | |
| 0.0005116757 | 1 | |
| 0.0005221947 | 1 | |
| 0.0005407966 | 1 | |
| 0.000569647 | 1 | |
| 0.0006196198 | 1 |
| Value | Count | Frequency (%) |
| 41.14543 | 1 | |
| 40.80777 | 1 | |
| 27.87659 | 1 | |
| 25.20317 | 1 | |
| 18.66603 | 1 | |
| 16.81018 | 1 | |
| 14.3314 | 1 | |
| 13.89603 | 1 | |
| 13.56933 | 1 | |
| 12.59677 | 1 |
| Distinct | 2512717 |
|---|---|
| Distinct (%) | 90.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05601202525 |
| Minimum | 0.0002880982 |
|---|---|
| Maximum | 14.52429 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 0.0002880982 |
|---|---|
| 5-th percentile | 0.004357691 |
| Q1 | 0.008711451 |
| median | 0.03777351 |
| Q3 | 0.07395164 |
| 95-th percentile | 0.1807854 |
| Maximum | 14.52429 |
| Range | 14.5240019 |
| Interquartile range (IQR) | 0.065240189 |
Descriptive statistics
| Standard deviation | 0.06815684939 |
|---|---|
| Coefficient of variation (CV) | 1.216825299 |
| Kurtosis | 1299.620031 |
| Mean | 0.05601202525 |
| Median Absolute Deviation (MAD) | 0.029997709 |
| Skewness | 11.73245242 |
| Sum | 155214.7551 |
| Variance | 0.004645356119 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.1017352 | 6 | < 0.1% |
| 0.1031431 | 6 | < 0.1% |
| 0.1349871 | 6 | < 0.1% |
| 0.1084829 | 6 | < 0.1% |
| 0.1029735 | 6 | < 0.1% |
| 0.1278086 | 6 | < 0.1% |
| 0.1030422 | 6 | < 0.1% |
| 0.04695546 | 6 | < 0.1% |
| 0.1579721 | 6 | < 0.1% |
| 0.1009113 | 6 | < 0.1% |
| Other values (2512707) | 2771037 |
| Value | Count | Frequency (%) |
| 0.0002880982 | 1 | |
| 0.0002945296 | 1 | |
| 0.0003206228 | 1 | |
| 0.0003519821 | 1 | |
| 0.0003707175 | 1 | |
| 0.0003707407 | 1 | |
| 0.0003747093 | 1 | |
| 0.0003870378 | 1 | |
| 0.0004120384 | 1 | |
| 0.0004849193 | 1 |
| Value | Count | Frequency (%) |
| 14.52429 | 1 | |
| 9.04069 | 1 | |
| 8.556941 | 1 | |
| 8.420018 | 1 | |
| 7.712391 | 1 | |
| 7.497026 | 1 | |
| 7.276915 | 1 | |
| 5.528741 | 1 | |
| 5.501844 | 1 | |
| 5.124316 | 1 |
| Distinct | 2458972 |
|---|---|
| Distinct (%) | 88.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.04065202339 |
| Minimum | 4.2032 × 10-6 |
|---|---|
| Maximum | 55.15096 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 4.2032 × 10-6 |
|---|---|
| 5-th percentile | 0.004368665 |
| Q1 | 0.008751572 |
| median | 0.02717371 |
| Q3 | 0.04588228 |
| 95-th percentile | 0.13778706 |
| Maximum | 55.15096 |
| Range | 55.1509558 |
| Interquartile range (IQR) | 0.037130708 |
Descriptive statistics
| Standard deviation | 0.09732236877 |
|---|---|
| Coefficient of variation (CV) | 2.394035048 |
| Kurtosis | 97945.76804 |
| Mean | 0.04065202339 |
| Median Absolute Deviation (MAD) | 0.018491108 |
| Skewness | 215.8193612 |
| Sum | 112650.7001 |
| Variance | 0.009471643463 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.0293873 | 7 | < 0.1% |
| 0.01049611 | 6 | < 0.1% |
| 0.01046309 | 6 | < 0.1% |
| 0.03408777 | 6 | < 0.1% |
| 0.03017065 | 6 | < 0.1% |
| 0.01889226 | 6 | < 0.1% |
| 0.01099824 | 6 | < 0.1% |
| 0.03648433 | 6 | < 0.1% |
| 0.01132452 | 6 | < 0.1% |
| 0.1069786 | 6 | < 0.1% |
| Other values (2458962) | 2771036 |
| Value | Count | Frequency (%) |
| 4.2032 × 10-6 | 1 | |
| 0.0002416363 | 1 | |
| 0.0002654212 | 1 | |
| 0.0002655984 | 1 | |
| 0.0003338849 | 1 | |
| 0.0003575227 | 1 | |
| 0.0003629721 | 1 | |
| 0.0004060516 | 1 | |
| 0.0004348774 | 1 | |
| 0.0004430136 | 1 |
| Value | Count | Frequency (%) |
| 55.15096 | 1 | |
| 52.3215 | 1 | |
| 48.50616 | 1 | |
| 28.0332 | 1 | |
| 26.41502 | 1 | |
| 20.70949 | 1 | |
| 19.05205 | 1 | |
| 16.70047 | 1 | |
| 16.66532 | 1 | |
| 14.79383 | 1 |
| Distinct | 2398599 |
|---|---|
| Distinct (%) | 86.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1033954568 |
| Minimum | 0.0001718631 |
|---|---|
| Maximum | 125.6025 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.1 MiB |
Quantile statistics
| Minimum | 0.0001718631 |
|---|---|
| 5-th percentile | 0.0093098 |
| Q1 | 0.0226173 |
| median | 0.06647153 |
| Q3 | 0.1130222 |
| 95-th percentile | 0.36526564 |
| Maximum | 125.6025 |
| Range | 125.6023281 |
| Interquartile range (IQR) | 0.0904049 |
Descriptive statistics
| Standard deviation | 0.2099898371 |
|---|---|
| Coefficient of variation (CV) | 2.030938724 |
| Kurtosis | 55879.59457 |
| Mean | 0.1033954568 |
| Median Absolute Deviation (MAD) | 0.04462286 |
| Skewness | 126.8038418 |
| Sum | 286518.8402 |
| Variance | 0.04409573171 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.1069008 | 9 | < 0.1% |
| 0.1070111 | 8 | < 0.1% |
| 0.10416 | 8 | < 0.1% |
| 0.1069292 | 8 | < 0.1% |
| 0.101817 | 8 | < 0.1% |
| 0.1196612 | 8 | < 0.1% |
| 0.104101 | 8 | < 0.1% |
| 0.1187577 | 8 | < 0.1% |
| 0.1038319 | 8 | < 0.1% |
| 0.1227017 | 7 | < 0.1% |
| Other values (2398589) | 2771017 |
| Value | Count | Frequency (%) |
| 0.0001718631 | 1 | |
| 0.00020425 | 1 | |
| 0.0002246803 | 1 | |
| 0.0004583807 | 1 | |
| 0.0004735135 | 1 | |
| 0.0009759665 | 1 | |
| 0.001006559 | 1 | |
| 0.001018574 | 1 | |
| 0.001079047 | 1 | |
| 0.001119909 | 1 |
| Value | Count | Frequency (%) |
| 125.6025 | 1 | |
| 70.74406 | 1 | |
| 51.20518 | 1 | |
| 48.38092 | 1 | |
| 43.06553 | 1 | |
| 38.72721 | 1 | |
| 35.62029 | 1 | |
| 33.70526 | 1 | |
| 28.62317 | 1 | |
| 28.58433 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| df_index | ID | u | g | r | i | z | uErr | gErr | rErr | iErr | zErr | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1237671129125683920 | 24.70152 | 22.30560 | 21.74474 | 21.40080 | 21.42348 | 0.781686 | 0.098980 | 0.078220 | 0.082370 | 0.240668 |
| 1 | 1 | 1237657772314394978 | 22.67397 | 21.96686 | 21.94899 | 21.72379 | 21.86105 | 0.390818 | 0.077608 | 0.128782 | 0.172128 | 0.565489 |
| 2 | 2 | 1237660764832400503 | 26.18586 | 22.41516 | 21.65361 | 21.57355 | 21.89604 | 0.371185 | 0.111313 | 0.079666 | 0.103673 | 0.496207 |
| 3 | 3 | 1237665584334570546 | 24.60471 | 21.87256 | 21.65432 | 21.64291 | 22.18519 | 0.826998 | 0.060778 | 0.065266 | 0.100126 | 0.507707 |
| 4 | 4 | 1237657190904300059 | 25.12164 | 22.79996 | 22.21146 | 21.85425 | 22.10207 | 0.682036 | 0.133950 | 0.111180 | 0.112090 | 0.480457 |
| 5 | 5 | 1237663543144874769 | 24.09210 | 22.85346 | 22.30763 | 22.04790 | 22.40283 | 0.642321 | 0.129014 | 0.124250 | 0.135843 | 0.529423 |
| 6 | 6 | 1237680503433463381 | 23.98054 | 24.24618 | 22.73013 | 21.30288 | 19.85708 | 1.701148 | 0.932739 | 0.479780 | 0.194279 | 0.188464 |
| 7 | 7 | 1237668298210279496 | 24.17533 | 26.84352 | 20.13889 | 18.72636 | 21.07554 | 2.558426 | 0.675604 | 0.061569 | 0.060317 | 0.877250 |
| 8 | 8 | 1237657877000749920 | 21.80250 | 20.75988 | 20.21546 | 20.13754 | 20.19818 | 0.138595 | 0.031938 | 0.025508 | 0.032024 | 0.116473 |
| 9 | 9 | 1237668350284333541 | 25.30149 | 21.97474 | 21.25391 | 21.16108 | 21.16152 | 0.645250 | 0.078875 | 0.052745 | 0.067674 | 0.213721 |
Last rows
| df_index | ID | u | g | r | i | z | uErr | gErr | rErr | iErr | zErr | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2771087 | 2771149 | 1237680100240786188 | 23.05869 | 22.36168 | 21.53303 | 21.11670 | 20.31947 | 0.601664 | 0.136826 | 0.119876 | 0.113521 | 0.223924 |
| 2771088 | 2771150 | 1237662302978114005 | 21.71403 | 21.37239 | 20.93563 | 20.71536 | 20.44528 | 0.125691 | 0.041273 | 0.040624 | 0.046867 | 0.112831 |
| 2771089 | 2771151 | 1237658611977028500 | 24.82571 | 23.13886 | 21.54026 | 20.30895 | 19.37826 | 2.869478 | 0.584235 | 0.202527 | 0.099133 | 0.153437 |
| 2771090 | 2771152 | 1237665584327098663 | 19.93252 | 19.61594 | 19.43063 | 19.45920 | 19.33438 | 0.039515 | 0.013005 | 0.013562 | 0.019402 | 0.055167 |
| 2771091 | 2771153 | 1237678790817284224 | 20.07754 | 19.66287 | 19.60898 | 19.63647 | 20.09604 | 0.055920 | 0.014554 | 0.019643 | 0.027199 | 0.162892 |
| 2771092 | 2771154 | 1237669701051154669 | 21.69291 | 21.80268 | 21.37547 | 21.36539 | 21.50789 | 0.205582 | 0.091514 | 0.109365 | 0.164178 | 0.592550 |
| 2771093 | 2771155 | 1237651535483830513 | 21.12952 | 21.31596 | 21.49134 | 21.18736 | 20.98013 | 0.102443 | 0.056157 | 0.089118 | 0.101749 | 0.310130 |
| 2771094 | 2771156 | 1237680311229809450 | 22.07685 | 21.79657 | 21.48659 | 21.40146 | 20.84215 | 0.232213 | 0.056864 | 0.076320 | 0.100319 | 0.263959 |
| 2771095 | 2771157 | 1237661463836229952 | 21.26601 | 20.75931 | 20.57111 | 20.42262 | 20.52666 | 0.085725 | 0.024749 | 0.028326 | 0.033730 | 0.121159 |
| 2771096 | 2771158 | 1237649920043581852 | 22.59109 | 21.27139 | 20.82199 | 20.61607 | 20.61585 | 0.305951 | 0.048192 | 0.045708 | 0.055666 | 0.222432 |